Skip to main content

The Dataset Collection

The Dataset Collection consists of large data archives from both sites and individuals.



rss RSS

9,784
RESULTS


Show sorted alphabetically

Show sorted alphabetically

SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Academic Data and Datasets
Academic Data and Datasets
collection
689
ITEMS
4,262
VIEWS
collection

eye 4,262

A collection of datasets and data related to academic issues.
Academic Torrents
Academic Torrents
collection
2,266
ITEMS
733,604
VIEWS
by ACADEMICTORRENTS.COM
collection

eye 733,604

Welcome to Academic Torrents! Making 14.15TB of research data available. We've designed a distributed system for sharing enormous datasets - for researchers, by researchers. The result is a scalable, secure, and fault-tolerant repository for data, with blazing fast download speeds.
C.elegans behavioural database
C.elegans behavioural database
collection
64
ITEMS
47
VIEWS
collection

eye 47

This experiment is part of the C.elegans behavioural database. For more information and the complete collection of experiments visit http://movement.openworm.org
Dumps of DISCOGS.ORG Metadata (2008-Present)
Dumps of DISCOGS.ORG Metadata (2008-Present)
collection
146
ITEMS
6,482
VIEWS
by DISCOGS.ORG
collection

eye 6,482

This is an unofficial mirror of the DISCOGS.ORG data collection, which is located at http://www.discogs.com/data/ . Discogs, short for discographies, is a website and database of information about audio recordings, including commercial releases, promotional releases, and bootleg or off-label releases. The Discogs servers, currently hosted under the domain name discogs.com, are owned by Zink Media, Inc., and are located in Portland, Oregon, USA. Discogs is one of the largest online databases of...
Harvard Dataverse
Harvard Dataverse
collection
1
ITEMS
619
VIEWS
collection

eye 619

Imageboard Datasets
Imageboard Datasets
collection
88
ITEMS
16,180
VIEWS
collection

eye 16,180

A collection of datasets arranged around imageboards.
Internet Census 2012
Internet Census 2012
collection
15
ITEMS
3,906
VIEWS
by Anonymous
collection

eye 3,906

Abstract While playing around with the Nmap Scripting Engine (NSE) we discovered an amazing number of open embedded devices on the Internet. Many of them are based on Linux and allow login to standard BusyBox with empty or default credentials. We used these devices to build a distributed port scanner to scan all IPv4 addresses. These scans include service probes for the most common ports, ICMP ping, reverse DNS and SYN scans. We analyzed some of the data to get an estimation of the IP address...
MusicBrainz Data Dumps
MusicBrainz Data Dumps
collection
967
ITEMS
8,544
VIEWS
collection

eye 8,544

The MusicBrainz Database is built on the PostgreSQL relational database engine and contains all of MusicBrainz' music metadata. This data includes information about artists, release groups, releases, recordings, works, and labels, as well as the many relationships between them. The database also contains a full history of all the changes that the MusicBrainz community has made to the data. Core data Artists Name, sort name, IPI, aliases, type, begin and end dates, disambiguation comment, MBID...
NIH Data Commons
NIH Data Commons
collection
10
ITEMS
1,627
VIEWS
collection

eye 1,627

The Data Commons Pilot Phase Consortium (DCPPC) is an NIH project to tackle the challenges of data-driven and data-intensive biomedical research: The data sets are too large to download There's minimal interoperability between and across data set providers Local compute capacity often is too limited to meet dynamic research needs These challenges are preventing biomedical data from reaching its full potential in basic research, clinical, and translational medicine. DCPPC aims to improve this...
OpenStreetMap datasets
OpenStreetMap datasets
collection
5,106
ITEMS
31,185
VIEWS
by OpenStreetMap contributors
collection

eye 31,185

OpenStreetMap (OSM) is a collaborative project to create a free editable map of the world. What is available? Planet.osm in XML format (current and full history), dumped weekly Planet.osm in the custom Protocolbuffer Binary Format (PBF) (current and full history), dumped weekly Metadata of all changes (changesets) in XML format, dumped weekly All discussions in XML format, dumped weekly User contributed notes, dumped daily How do I search this collection? The items in this collection are...
Topics: openstreetmap, osm, maps, data, mapping, map, dumps
The Dataset Collection
by iwiftp
data

eye 33

favorite 0

comment 0

IWIFTP archive
Topic: pony
Screenshot Compilations
Screenshot Compilations
collection
3
ITEMS
346
VIEWS
collection

eye 346

Compilations of screenshots generated automatically or semi-automatically.
Unsorted Datasets
Unsorted Datasets
collection
278
ITEMS
121,452
VIEWS
collection

eye 121,452

Unsorted Datasets
YFCC Datasets
YFCC Datasets
collection
13
ITEMS
150
VIEWS
collection

eye 150

Part of an August 2021 download of roughly 40 % of the Flickr images referenced in the YFCC100M dataset.
The Dataset Collection
by IWIFTP
audio

eye 30

favorite 0

comment 0

IWatchItForThePlot / IWIFTP mirror 2020-05-01 These are multipart TAR archives, but every single TAR can be individually extracted. You do NOT need all of them. For further information on how to extract please look here: https://unix.stackexchange.com/a/505773 Disclaimer: These files are only uploaded for archival purposes. Illegitimate use is strictly prohibited.
The Dataset Collection
web

eye 0

favorite 0

comment 0

The Dataset Collection
web

eye 0

favorite 0

comment 0